
X09Rec'dPCT/PT0 OSJUNaHB. 




WO 2004/050693 



PCT/GB2003/005256 



ENGINEERING REDOX PROTEINS 



This invention relates to engineering proteins so that they have the required 
properties to carry out a particular function. 



engineered protein structures capable of the same specific activities as their 
natural counterpart, but without the complexity associated with cellular functions. 
This has been addressed either by the de novo synthesis of peptides using 
specified templates, or by mutating wild type redox proteins or more recently by 
1 0 modular assembly of existing proteins. 

Protein engineering work has been carried out on many proteins including 
the E.coli repressor of primer (rop) a regulatory protein which binds nucleic acid 
and has a 4-a-heIix bundle motif. 



15 4-a-helix bundle is typified by an association of 4-a-helices that cross at an angle 
of about 20''. The helices may be linked by loops which can vary in their length, 
sequence and connectivity. The basic 4-helix bundle framework can support a 
diverse variety of activities, from oxygen transport in myohemerythrin to receptor 
binding in haematopoietic grov\4h factors and RNA binding in rop. 

2 0 Wild type rop is dimeric, each monomer being formed by two helices 

connected by a sharp bend. Predki and Regan (1995) describe producing a single 
chain rop. Their method separates helix 1 and 2 sequences of one rop molecule 
and places helix 1 before helix V and places helix 2 after helix 2* of a second rop 
(consisting of helices 1' and 2'). The monomeric rop contains 4 a-helices in the 

25 order 1-1-2-2 and ail a helices are expressed as a single polypeptide chain. This 
method of ordering the 4 a helices maintains the wild type N- and C-termini and the 
connection between helix 1' and 2\ 



3 0 primer) and 

(b) a redox centre. 

The inventors have provided a stable protein capable of carrying out redox 
activities and other properties of the protein can be controlled. 



5 



A very attractive goal for chemistry and biology is the creation of 



The 4-a-helix bundle is a commonly occurring, stable, structural motif. The 



(a) 



This invention provides a protein comprising: 

a 4-a-helix bundle motif formed from the a-helices of rop (repressor of 
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This invention takes an existing simple and stable protein fold as a structure 
in which to insert new functions. This approach mimics the natural process of 
protein evolution, where within a protein superfamily with a particular structure a 
diversity of function can be found. The protein of this invention Is based on the 
5 well-studied and understood 4-helix-bundle protein fold. 

The protein of the current invention comprises a redox centre. A redox 
centre is an atom, group of atoms or moiety capable of accepting and donating 
electrons. The redox centre may be a metal atom such as iron or copper which is 
stable in two different oxidation states. The redox centre may be an iron sulphur 

10 centre which generally comprise two or four iron atoms bound to an equal number 
of sulphur atoms and to cysteine side chains. The redox centre may also be a 
haem group which comprises an Iron atom held in the middle of a porphyrin 
structure. Alternatively the redox centre may not contain a metal and may be an 
organic redox centre such as FMN or FAD. 

15 The redox centre may be coordinated to the protein structure formed from 

the Q-helices of rop. This may be the case when the redox centre is metal such, as 
an iron or copper atom. Other redox centres such as haem are associated to the 
Q-helices of the protein scaffold. Residues such as histidlne, leucine, methionine 
and cysteine of the a-helices In the protein structure can be used to co-ordinate the 

20 iron atom of the haem as axial iigands when the redox centre is haem. 

Residues suitable to coordinate to the redox centre are Introduced into the 
helices of rop, and any residues present in the wild type sequence ttiat could cause 
bonding to occur at an inappropriate place are removed, by mutation of the 
polypeptide sequence. 

25 Simple natural cytochromes have reported mid-point potentials covering the 

range of -475 to +310 mV. The redox midpoint potential of the protein of this 
invention lies in the range -485 to +320 mV. The midpoint potential can be altered 
depending on the use of the redox protein. A major determinant of the redox 
. potential of haem proteins is thought to be haem exposure. In addition other 

3 0 factors have also been implicated including the polarity of the haem environment, 
the strength of the axial ligand field, and the electrostatic interactions between 
haem and its propionates. (Stellwagen (1978; Kassner, R.J. (1973); Valentino J.S. 
et al (1979) & Moore, G.R. (1983)). It has also been demonstrated that the redox 
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potential of cytochrome bggz can be varied by over lOOmV by changing the amino 
acids in the vicinity of the haem, (Springs, S.L. et al (2000)). 

A second aspect of this invention provides a method of producing the 
protein of this invention. Random mutagenesis or site directed mutagenesis is 
5 performed to engineer the protein such that it contains the necessary residues to 
enable redox centre binding to the protein. Therefore the DNA sequence which 
encodes the protein of this invention, either isolated or incorporated into a vector is 
also part of the invention. This sequence is then expressed in and the polypeptide 
purified from a microorganism. (Alternatively it is possible that the polypeptide can 

10 be produced using a solid phase peptide synthesis). The resulting polypeptide. 

which is capable of binding a redox centre, is then incubated with an excess of that 
redox centre to ensure incorporation into the protein. 

A third aspect of this Invention involves the use of the engineered protein. 
The protein of the current invention may be used as an electron earner either on its 

15 own or in a series of electron carriers. A series of electron earners may be an 

electron transport chain, or it may include an electrode. An electron transfer chain 
transports electrons from a higher to a lower energy level along a series of electron 
earner molecules. An electron earner molecule is a molecule that transfers an 
electron from a donor molecule to an acceptor molecule. An electron acceptor is a 

2 0 molecule that takes up electrons easily, thereby gaining an electron and becoming 

reduced, whereas an electron donor is a molecule that easily gives up an electron, 
becoming oxidised in the process. Therefore the protein of this invention can be 
used in a method which involves passing electrons along a sequence of electron 
earners, in which each electron carrier is reduced and then oxidised (or vice versa) 
25 by electron movement and the sequence of electron carriers; includes ttie protein 
of this invention. The electron transport chain may comprise natural or synthetic 
electron carriers. The protein of this invention can be used in a method involving 
passing electrons along a sequence of electron earners, in which each electron 
earner is reduced and then oxidised or vice versa by electron movement and the 

3 0 protein of this Invention fonns part of the sequence of electron earners. In such a 

method electrons are generally moved along a gradient of electron carriers with 
successively lower or higher redox potential. 

An alternative use of the protein of this Invention is as part of an apparatus 
comprising the protein assodated with an electrode in a manner that electrons may 
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be passed from one to the next. The protein may be bound or adsorbed onto the 
electrode. This includes use of the protein of this invention in cyclic voltammetry, 
which is used to provide information about the redox protein's mid-point potential. 
As has been mentioned above the 4-a-helix bundle of rop protein forms a 
5 stable structure. The redox protein of the present invention is also stable and has 
an unfolding free energy when denaturant is added to the protein of AGobs^a^ 
wherein y^3.0 kcal/mol. The redox protein of this invention contains mutations 
which allow bonding of the redox centre. In addition to incorporation of a residue 
or residues for bonding to the redox centre a aromatic amino acids such as 
10 tryptophan can be inserted into the protein. Insertion into the protein staicture of 
aromatic amino acids provides a method of spectrophotometric detection of the 
protein. It is necessary to ensure that the insertion of residues allowing 
spectrophotometric detection does not destabilise the protein. 

15 Brief description of the figures 

Figure 1 shows ribbon diagram representations of rop. The 3D models for 
monomeric rop were derived from the wild type dimeric rop model using the 
BIOPOLYMER and DISCOVERS modules within Insight IL 
Figure 2 shows the absoriDance spectra of a bis-Histidine haem binding rop. The 

20 data were collected on a Hewlett-Packard 8453 UV-visible photodiode array 
spectrophotometer. 

Figure 3 shows the electrochemical properties of the rop haem protein. 
Figure 4 shows the DNA and protein sequence of wild type dimeric rop with the a- 
helix regions shown in boxes. Also shown are psp7 amplification upstream 
25 sequence and asp4 amplification downstream sequence. 

Figure 5 shows the DNA and protein sequence of monomeric rop <S55) (created by 
Predki & Regan) with the 

a-helix regions highlighted. Also shown are psp 7 amplification upstream 
sequence and asp4 amplification downstream sequence. 
3 0 Figure 6 shows a protein sequence of a redox protein of the present invention 
formed from the a-helices of rop, produced according to example 1. 

In the preferred type of protein of this Invention the 4-a-helix bundle 
structure used to develop the proteins of this Invention is monomeric rop as 
described above. This protein is developed from the wild type dimeric rop which is 
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shown in Figure 4 and sequence ID Nos 5 and 6. In Figure 4 the portions of the 
sequences enclosed in boxes are the a-helices. in the momomeric protein all 4 
helices are expressed as a single polypeptide chain. Monomeric rop is shown in 
Figure 5 and sequence ID Nos 7 and 8. In Figure 5 the portions of the sequence 

5 enclosed in boxes are the a-helices. When specific residues are referred to the 
numbering system relates to the monomeric rop developed previously and shown 
in Figure 5. A structure for monomeric rop was generated by homology modeling 
using the program Insight II (MSI, San Diego) using the NMR structure as a 
template. This is shown in Figure 1 which shows the design of monomeric rop. In 

0 this embodiment native dimeric rop (left) was engineered into a single polypeptide 
chain monomeric variant (right) facilitated by the introduction of two 6 x Gly loops 
(Predki & Regan. 1995). The native His residues (76, 78. 107. 109) are shown in 
stick representation. In the protein of the present invention these His residues are 
removed as described below and residues necessary for interaction with the redox 

5 centre are introduced into the polypeptide. Removed means a particular amino 
acid is no longer at that position. Preferably this is achieved via substitution, 
however deletion may be a possibility. Introduced means a particular amino acid is 
now found at that position. Preferably this is achieved via subtitutlon, however 
insertion may also be a possibility. Monomeric rop provides a better template for 

) engineering than its dimeric precursor because of the ability to introduce single 
mutations. 

As can be seen from Figure 5 the first two helices and the second two 
helices of monomeric rop have the same sequence. The helix secondary structure 
may start one residue eariier in helix 4 (with A) than helix 3 because the GGGGG 
loop preceding it is long enough to allow reorientation, whilst the a-helix secondary 
structure in helix 3 probably begins at D as only one G is present between helices 
2 and 3, therefore A may still be part of the loop region. 

In the mutated monomeric rop proteins of this invention the a-helix regions 
are generally conserved and preferably the helices of ttie 4-a-helix bundle of the 
protein of this invention show 60% or 70%, preferably 80 similarity or identity witti 
Oie a-helix regions indicated in Figures 4 and 5 and shown in sequence ID No. 1. 
2. 3 and 4. The. variation is generally introduced by removing native His residues 
76. 78. 107 and 109 as they could lead to inappropriate binding of a redox cenfre, 
or by including residues required to incorporate the redox centre into the protein or 
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by altering residues in the vicinity of the redox centre. More variation may be 
introduced by residues which allow detection of the protein, for example including 
aromatic amino acids such as tryptophan. The loops between each a-helix may. 
however, vary significantly (from that shown in Figure 5 and sequence ID Nos 7 
5 and 8) in terms of their length, sequence and connectivity, provided that the loop 
regions allow the a-helices to come together in 3D space to form the 4-a-helix 
bundle. 

The terms as used above, "similarity and "identity*' are known in the art. 
The use of the term "identity" refers to a sequence comparison based on identical 

10 matches between con-espondingly indentical positions in the sequences being 
compared. The term "similarity" refers to a comparison between amino acid 
sequences, and takes into account not only identical amino acids in corresponding 
positions, but also functionally similar amino acids in corresponding positions. 
Thus similarity between polypeptide sequences indicates functional similarity, in 

15 addition to sequence similarity. 

Levels of identity between gene sequences and levels of identity or 
similarity between amino acid sequences can be calculated using known methods. 
In relation to the present invention, publicly available computer based methods for 
determining identity and similarity include the BLASTP. BLASTN and FASTA 

2 0 (Atschul ef a/.. J, Molec. Biol., 1990; 215:403-410), the BLASTX program available 
from NCBI, and the Gap program from Genetics Computer Group, Madison Wl. 

A more preferred protein of this invention is based on monomeric rop as 
described above and contains the following mutations, the histidines at H76 H78 
HI 07 and H109 are substituted for other residues. In one embodiment H76A and 

25 H107A substitutions create a binding site for a haem group. In addition Oie 

mutations H78W and H109W may occur to create a means of spectrophotometric 
detection of \he novel redox protein. Furtiier mutations may also be incorporated 
into this protein for example L56H and L1 13H. The sequence of the espedally 
preferred redox protein with mutations H76A. H78W, H107A, H109W and L56H 

30 and L113H is given in sequence ID No. 11 and in Figure 6. 

All the proteins of tiiis invention are stable and have an urifolding free 
energy as described above. The stability of the proteins of this invention is 
comparable with or greater than the stability of wild type rop itself. 
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Example 1 Production of monomeric rop possessing a haem group. 
The monomeric rop constmcts, (S55). was the starting point for the rational 
design strategy (Munson et al 1994; PF Predki & Regan. 1995). Site directed 
mutagenesis was performed either by the "Megaprimer method" or using a method 
5 based on the "Quik Change" method (Stratagene Europe) (Trade mark)in which a 
plasmid is amplified using two complementary oligonucleotides containing the 
desired mutation. DNA manipulations were cam'ed out in E.co// DH5 a or £-co// 
XLI-Blue (stratagene Europe). When the "IWIegaprimer method" was used, the rop 
gene was amplified by PGR using flanking primers psp7 and asp4 (sequence ID 
10 No 9 and 1 0 respectively) and mutations were confirmed by sequencing of the 

PGR product. Oligonucleotides were obtained from Amersham Pharmacia Biotech 
(Table 1). 

Table 1. Oligonuleotides used to engineer the haem binding mutants. 



15 Name Olifionucleotide Forward Oligonucleotide Reverse 

asp4 gctcagcggtggcagcagccaac 
H76A,H78A gaatcgct^ccgacgccgc^alg 
H76A,H78W gaatcgcttgctgactgggctgat 

20 H107A,H109W 

L56H ctcagacattaacgcacctagagaagcttaacg gagctcgttaagcttctctaggtgcgttaatgtctgag 

agctc 

L113H cgccgctgatgagcattaccgcagctgcc ggcagctgcggtaatgctcatqagcggcg 

L63M gaagcttaacgagatgggggcggatgaacagg cctgttcatccgcccccatctcgttaagcttc 

P^^^H agctgccttgc ccgtcacggcgacgacggtg caccgtcgtcgccgtgacgggcaa^cagct 

25 

The initial step was to remove potential non-specific ligands by the 
replacement of endogenous histidine residues creating the mutant H76A. H78W, 
H107A. H109W, see Figure 1A. A haem binding site was created In this protein by 
the further mutations L56H, L113H, see Figure 1B. Figure 1B shows the design of 

3 0 haem binding sites in monomeric rop. A monomeric rop variant (left) lacking the 
native His residues (H76A, H78W. H107A, H109\A/) was produced by site directed 
mutagenesis. Mutations L56H, L1 13H were incorporated to introduce haem 
ligands. A sequential approach was taken to incorporate multiple mutations into 
rop. Constructs leading to the creation of a haem-binding rop variant are listed in 

35 Table 2 below. The RDM14.5 construct, containing the (L56H. L113H) haem 
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binding site (rop-56H/113H) was made by.the subcloning of the appropriate 
fragments from RDI\/IlO-b4 and QC1 1-J1. 



Table 2. Rop constructs produced for this study with mutations from tiie S55 

sequence. 



35 



Construct Mutations 

S55 : ~ — — 

JWl H107A, H109A 

JWBsub H76A, H78A, H107A, H109A 

1 ° JW2 H76A, H78W, H107A, H109W 

JW7 H76A, H78W. H107A, H109A 

RDM2. 1 H76A, H78W, H107A^ H109W, L63H 

QC3.5 H76A H78W, H107A, H109W, L63H, F121H 

QC7.2 H76A. H78W, H107A, H109W, L63M, F121H 

^5 RDM10b4 H76A, H78W, H107A, H109W, L56H, G64R 

QCU-Jl H76A, H78W, H107A, H109W, LI 13H 

RDM14.5 H7 6A H78W. H107A, H109W. L56H, LI 13H 

The haem binding variant was heterologously expressed from the pMR103 

2 0 plasmid in E.cofi BL21 (ADE3) and purified to homogeneity by standard techniques. 

Haem was Incorporated into the apo-protein by addition of a 4-fold molar excess of 
hemin chloride solubilised in DMF. incubation overnight, and removal of free haem 
by iron exchange chromatography (DEAE sepharose FF. Amereham Pharmacia 
Biotech UK Ltd). The resulting haem binding rop protein has spectral properties 

2 5 characteristic of native b-type cytochromes. Figure 2 shows spectra of bls- 

Histidine haem bind rop variant In oxidized and reduced fonn. Solid line: 20mM 
haem binding monomeric rop variant L56H. L113H in 10mM sodium phosphate 
buffer, pH80. Dashed line: 20mM rop following reduction by sodium dithionite. The 
oxidized protein has a at 413 nm, which upon reduction with sodium dithionite 

30 shifts to 426 nm with distinct a and p peaks at 559 nm and 531 nm respectively. 

The spectral characteristics are typical of b-type cytochromes with a sbc coordinate 
low spin Iron, and this is a strong indication that the haem group has been 
incorporated into the helix bundle as predicted in the design. 



Example 2. Testing the r edox properties of this novel haem protein 
The redox properties of this novel haem protein rop-56H/113H were 
detennined by spectroelectrochemistry and cyclic voltammetry and are presented 
in Figure 3. Figure 3A shows redox titration of rop variant witti bound haem. 
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Titration of sodium dithionite into protein solution (40pl\/l) was performed 
anaerobically. Following equilibration the UV-visible spectrum was obtained 
(Hewlett-Packard 8453 spectrophotometer) and the potential nneasured 
simultaneously (Autolab PGSTAT10 potentiostat). Titrations were performed at 
5 20^*0 in pH 7.8 buffer. 

Figure 3B shows cyclic voltammograms of rop variant with bound haem. 
Cyclic voltammetry was performed on a bare glassy carbon electrode over a range 
of scan rates. 

Spectrophotometic titrations were performed in an anaerobic a quartz 
10 cuvette (Hellma (England) Limited UK) equipped with a platinum-mesh worldng and 
an Ag/AgCI reference electrode, according to the procedure described in Dutton, 
P.L., 1978. Ambient redox potentials were adjusted by addition of aliquots {<S\A) 
of sodium dithionite or potassium ferricyanide. Titrations were performed in 50mM 
potassium phoshate, pH 7.5. Electrode-solution mediation was facilitated by the 
15 following mediators: phenazine methosulphate (5 pM), duraquinone (5 \M), 2- 

hydroxy 1,4 napthoquinone (5 pM) benzyl viologen (2 pM), indigocarmine (0.5 pIM), 
resorufin (0.5 fjM). The modified cuvette was flushed with oxygen-free nitrogen 
before and during the measurements. After each addition of sodium dithionite (0.5 
|j| of a 10 mM solution) the protein was allowed to equilibrate and at each potential, 

2 0 the optical spectrum was recorded using a Hewlett-Packard 8452 diode array 

. spectrophotometer. The fraction of the reduced protein was estimated at each 
potential by following the shift in Soret peak (413 to 426 nm) and was plotted 
against potential. The data points were fitted to the Nemst equation with n=1: 

E =:Em +(RT/nF) In (tox]/[red]) 
25 where [ox] and [red] are the concentration of oxidised and reduced haem 

respectively, E is the solution redox potential at equilibrium, n is the number of 
electrons, Em is the mid-point potential, F is the Faraday constant, R is the gas 
constant and T is the absolute temperature. 

The redox titrations with sodium dithionite (Figure 3A) show one larger 

3 0 component (78%) at -154 ± 2 mV and one smaller (22%) at 17 ± 9 mV, relative to 

the normal hydrogen elertrode. The presence of two reduction potentials is likely to 
be due two non-equivalent orientations in the engineered rop. This is at present 
being investigated. Only one mid point potential of Em= -134 ±13 mV could be 
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measured using cyclic voltammetry on a bare glassy-carbon electrode using a 
modified Hagen cell (Figure 3B). 

These reduction potentials are consistent with those measured for natural 
cytochromes and heme-containing synthetic peptides, that cover the range of -475 
5 to +310 mV. The properties of the rop heme binding protein are at present being 
optimised with the creation of a cavity that accommodates the heme macrocycle 
with a unique orientation, with the possibility of tuning the redox potential. It has 
been demonstrated that the redox potential of cytochrome b562 can be varied by 
over 100 mV by changing the aminp acids in the vicinity of the heme and Dutton 
10 and coworkers demonstrated that by utilizing alternative porphyrins it was possible 
to modulate the redox potential of their designed heme protein by as much as 225 
mV. It is anticipated that the redox properties of this designed protein could 
likewise be "tuned" as required for particular applications. 

15 Example 3: Stabilitv of rop 

The protein of this invention shows a high degree of stability which may be 
measured as the unfolding free energy when a denaturant is added to the protein, 
of AGobs "2° is greater than or equal to y wherein y^3.0 kcal/mol. 
The stability test refers to the chemical stability. This was measured in solution by 

2 0 detecting changes of the circular dichroism signal at 222 nm and changes of 

fluorescence anisotropy signal of the protein upon additions of guanidinium 
hydrochloride. Experiments were carried out by addition to the protein of 
denaturant (guanidinium hydrochloride) to reach final concentrations from zero to 
6.5 M. The parameter that is related to the stability is the AG of unfolding in water 
25 (AG^V). . 

The data derived from the CD and fluorescence anisotropy were analysed 
assuming a two state model, fp + fy = 1, where fp and f^ represent the fraction of 
total protein in the folded and unfolded state respectively. The equilibrium constant 
Ku and the free-energy change AG^bs^ for the unfolding reaction were calculated 

3 0 from: 

Ku = fu/(1-fu) = fu/fF 
Gobs'" = - RT In Ku 

According to the linear extrapolation method, the unfolding free energy depend 
lineariy from the denaturant concentration (Shellman, J. A.. 1987): 
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Gobs'' = G,,3V-mMGuHCI] 
The results are shown in table 3. 

Table 3: Thermodynamic parameters characterizing the unfolding process 

5 



Sample 


DG1 

(kcal/mol) 


(kcal/mol M) 


DG2 
(kcal/mol) 


m2 

(kcal/mol M) 


S55 


8.7+0.5 


2.4±0.3 






JW2 


8.7±0.8 


1.9±0.5 






QC7.2 


3.1±0.5 


2.9±0.2 


3.9±0.5 


1.1±0.3 


RDM14.5 


3.5±0.6 


3.0±0.9 


4.0±0.5 


1.0±0.2 



The mutant Jw2, the parent from which all of the mutants containing the 
haem have been obtained, shows a AG of unfolding of 8.7 kcal/mol, as also 
shown for the original monomeric rop S55, 8.7 kcal/mol. 

15 A two-step transition model was necessary to adequately fit the data of 

RDM14.5 and QC7.2. resulting in a total free energy change of 7.5±1.1 kcal/mol 
and 7.0±1.0 kcal/mol respectively (see table 3). The unfolding of the starting 
construct proteins S55 and JW2 have been also reported for comparison in table. 
In these cases the unfolding process was satisfactorily represented by a simple 

2 0 two-state transition with similar free energy changes (table 3). These findings 

suggested that the haem bound proteins undergo a different unfolding mechanism 
due to the presence of the haem group. This hypothesis has been tested 
measuring the changes of QC7.2 and RDM14.5 absorption spectmm in the visible 
region, as a function of guanidinium hydrochloride (GdHCI) concentration. Most of 

25 the signal at 415 nm is already lost at 3 M GdHCI. confimning the loosening of the 
haem group. 



30 
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